(N 

o 

(N 



0\ 



THE AUTOCORRELATION OF THE MOBIUS FUNCTION 
AND CHOWLA'S CONJECTURE FOR THE RATIONAL 

FUNCTION FIELD 

DAN CARMON AND ZEEV RUDNICK 



o i 

D ' Abstract. We prove a function field version of Chowla's conjecture on 

■ the autocorrelation of the Mobius function in the limit of a large finite 

field. 



E~~ 1 ■ 1. Introduction 

There is a well-known equivalence between the Riemann Hypothesis (RH) 
■ and square-root cancellation in sums of the Mobius function fJ,(n), namely, 

RH is equivalent to En<7vA i ( n ) = 0(-ZV 1/2+o(1) ). This sum measures the 
correlation between /u(n) and the constant function. Recent studies have 
explored the correlation between fj,(n) and other sequences, see [2J [1] . 
Sarnak [8] showed that fi(n) does not correlate with any "deterministic" 
(i. e., zero entropy) sequence, assuming an old conjecture of Chowla [3] on 
the auto-correlation of the Mobius function, which asserts that given an 



I/"") | r-tuple of distinct integers a%, . . . , a r and ej 6 {1,2}, not all even, then 

in 



(1.1) lim — ^ ^(n + ai) ei fi(n + a r Y r = 

iV-s-oo N 

O ' n<N 

Note that the number of nonzero summands here, that is the number of 
n < N for which n + ct\, . . . n + a r are all square-free, is asymptotically 
c(a)N, where c(a) > if the numbers a\, . . . ,a r do not contain a complete 
^ . system of residues modulo p 2 for every prime p [6], so that (jl.ip is about 

non-trivial cancellation in the sum. 

Chowla's conjecture (jl.ip seems intractable at this time, the only known 
case being r = 1 where it is equivalent with the Prime Number Theorem. 
Our goal in this note is to prove a function field version of Chowla's conjec- 
ture. 

Let F q be a finite field of q elements, and F q [x] the polynomial ring over 
F q . The Mobius function of a nonzero polynomial F G F q [x] is defined to be 
n(F) = (-l) r if F = cPt . . . P r with ± c G F q and Pi, . . . , P r are distinct 
monic irreducible polynomials, and fJ-(F) = otherwise. 



Date: December 12, 2012. 

The research leading to these results has received funding from the European Research 
Council under the European Union's Seventh Framework Programme (FP7/2007-2013) / 
ERC grant agreement n° 320755. 

1 



2 



DAN CARMON AND ZEEV RUDNICK 



Let M n C ¥ q [x] be the set of monic polynomials of degree n over ¥ q , 
which is of size #M n = q n . The number of square- free polynomials in M n 
is, for n > 1, equal to q n — q n ~ l [7 S Chapter 2]. Hence, given r distinct 
polynomials ai, . . . , a r G ¥ g [x], with deg ay < n, the number of F G M n for 
which all of F(x) + ay (a;) are square- free is q n + O(ro n_1 ) as q — ?■ oo. 

For r > 0, distinct polynomials ai, . . . , a r G F g [x], with deg aj < n and 
ej G {1,2}, not all even, set 

(1.2) C(a u ...,a r ;n) := ^ fi(F + ai) ei . . . fi(F + a r ) er 

FeM n 

For r = 1 and n > 1 we have ^fgm„ = 00 Chapter 2]. For n = 1 we 
have n(F) = — 1 and the sum equals (—l)^ ej q n . For n > 1, r > 1 we show: 

Theorem 1.1. Fix r > 1 and assume that n > 1 and g is odd. T/ien /or 
any choice of distinct polynomials a\, . . . ,a r £ ¥ q [x], with max deg ctj < n, 
and €i G {1,2}, noi aZZ even 

(1.3) |C(ai,...,a P ;n)| < 2rnq n ~^ + 3rnV _1 
Thus for fixed n > 1, 

(1.4) lim -i- £ /i(F + a^ 1 . . . /i(F + a r )^ = 
9 ->oo #M n ^ 

under the assumption of Theorem giving an analogue of Chowla's con- 
jecture (jl.ip . 

Our starting point is Pellet's formula , see e.g. [H Lemma 4.1], which 
asserts that for the polynomial ring ¥ q [x] with q odd (hence the restriction on 
the parity of q in Theorem 11 .1|) . the Mobius function fJ,(F) can be computed 
in terms of the discriminant disc(-F) of F(x) as 

(1.5) M (F) = (-l) de ^ X2 (disc(F)) 

where \2 is the quadratic character of ¥ q . That will allow us to express 
C(ai, ...,a r ;n) as a character sum and to estimate it. 

2. Reduction to a counting problem 
2.1. Character sums. We use Pellet's formula fll .5f) to write 

(2.1) C(a 1 ,...,a r ;n) = (-l) nr ^ X 2 ( disc(F + Q!) ei . . . disc(F + a r ) e 

FeM„ 

Since disc(-F) is polynomial in the coefficients of F, (|2.ip is an n-dimensional 
character sum; will will estimate it by trivially bounding all but one vari- 
able. We single out the constant term t := F(0) of F G M n and write 
F(x) = f{x) + t, with 

(2.2) f(x) = x n + a n _ix n " 1 + • • • + aix 
and set 

(2.3) £>/(*):= disc(/(a;) + t) 



which 
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(2.4) \C( ai ,...,a r ;n)\< £ 

agF; 



n-l 
1 



We use Weil's theorem (the Riemann Hypothesis for curves over a finite 
field), which implies that for a polynomial P(t) £ Fg[i] of positive degree, 
which is not proportional to a square of another polynomial, we have [91 §2] 



(2.5) 



<(degP-l)g 1/2 , P(t)^cH 2 (t) 



For us, the relevant polynomial is P(i) = Df+ ai (t) ei . . . Df +Ctr (t) er , which 
has degree < 2r(n — 1). Instead of requiring that it not be proportional to 
a square, we impose the stronger requirement that for some i with odd, 
D f +ai (t) has positive degree and is squarefree and that for all j such that 
j ^ i, Df +a .(t) and Df +a .(t) are coprime. We denote the set of coefficients 
a satisfying the stronger condition by G n (the "good" a's, where we can 
apply (|2.5p ). and let G n = F™ _1 \G n be the complement of G n , where we 
use the trivial bound q on the character sum. Thus we deduce that we can 
bound 

\C(a u . . . , a r ; n)| < £ (2r(n - 1) - l)y/q + ^ q 

(2.6) ieG„ a£G n 

< ( 2r ( n - 1) - l)q n "2 + 

where we have used the trivial bound j^G n < q n ~ x for the first part of the 
sum. Theorem 11.11 will follow from 

Proposition 2.1. Assume that n > 1 and maxdegaj < n. Then 

#G c n < 3rn 2 q n - 2 

2.2. Bounding We can write G^ C A n U B„ where: 

• A n = A Uj i is the set of those a G F™ -1 for which Df +Cei (t) is either a 
constant or is not square-free, that is 

(2.7) A n = {a e F^ 1 : Df +ai (t) is constant or disc(£>/ +a J = 0} 

• B n = Uj^iB(j) where B(j) are those a's for which Df +a .(t) and 
Df +aj (t) have a common zero, which can be written as the vanishing 
of their resultant 

(2.8) B(j) ={ae F^" 1 : Res(D f+ai (t), D f+otj (t)) = 0} 

What is crucial is that A n and each B(j) are the zero sets of a polynomial 
equation in the coefficients a; this is a key property of the discriminant and 
the resultant. 

We will need the following elementary but useful uniform upper bound 
on the number of zeros of polynomials (cf §4, Lemma 3.1]): 
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Lemma 2.2. Let h(X\, . . . , X m ) G F 9 [Xi, . . . , X m ] be a non-zero polynomial 
of total degree at most d. Then the number of zeros of h(X±, . . . , X m ) in ¥ q n 
is at most 

(2.9) #{x G F™ : h(x) = 0} < dq" 1 ' 1 

As we will see below (see § I2.3p . the equation defining A n has total degree 
3(n — l)(n — 2) in the coefficients ai, . . . , o n _i, and the equation defining 
-B(j) has total degree < 3(n — l) 2 . Therefore, by Lemma l2.2[ if we show 
that the equations defining A n , B(j) are not identically zero, then we will 
have proved 

(2.10) #A n < 3nV -2 
and 

(2.11) #B n < 3(r - l)n 2 q n - 2 

This immediately gives Proposition 12.11 

In order to show that a polynomial h € ¥ g [Xi, . . . , X m ] is not identi- 
cally zero, we may instead consider it as a polynomial defined over ¥ q , 
the algebraic closure of ¥ q . In this context we can investigate the zero set 
%h = { a € F. : h(a) = 0}, which is a subvariety of the affine space A m . 
The polynomial h is not identically zero if and only Z ^ A m . This shall be 
our main tool in the following sections. 

2.3. Resultant and discriminant formulas. The discriminant disc(F) of 
a polynomial F(x) = a n x n +a n -ix n ~ 1 + - • -+ao, a n / 0, is given in term of its 
roots n, . . . ,r n in the algebraic closure ¥ q as discF = a 2n-2 ni<j( r i — r j) 2 -> 
and is a homogeneous polynomial with integer coefficients in ao, . . . , a n , with 
degree of homogeneity 2n — 2, has total degree 2n — 2, and has degree n — 1 as 
a polynomial in ao- Moreover if ai is regarded as having degree i then disc(-F) 
is homogeneous of degree n(n — 1), that is for every monomial c r Y\i in 
disc(F), 

(2.12) s £ j ir i =n{n-\) 

i 

The resultant of two polynomials F(x) = a n x n + . . . , G = b m x m + . . . , of 
degrees n and m, is 

(2.13) Res(F, G) = « J] (p - rj) 

F(p)=0G( V )=0 

It is a homogeneous polynomial of degree m + n in the coefficients of F and 
G, in fact it is homogeneous of degree m in ao, . . . ,a n and of degree n in 
bo, . . . ,b m . Moreover if ai, bi are regarded as having degree i then Res(.F, G) 
is homogeneous of degree mn. We have 

(2.14) Res(F,G) = a- [] G(p) = (-l) mn b n m [] F(r?) 

F(p)=0 G(r,)=0 
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Furthermore, the discriminant of a polynomial F(x) = a n x n + • • • + oo of 
degree n may be computed in terms of the resultant as 

(2.15) disc F = (_l)«(«-l)/2 a "-deg(F')-2 R&s ^ p ^ 

We apply this to compute the discriminant of DAt) = disc(/(x) + t), 
f(x) = x n +a n _ix ni +- • -+a%x. The discriminant disc(D/(t)) is a polynomial 
in the coefficients a±, . . . , a n _i of f(x). We claim that the total degree of 
discL»/(t) is 3(re - l)(n - 2). Indeed, D f (t) = E"=o 

is a polynomial 

of degree n — 1 in t, and since it is homogeneous of degree 2(n — 1) in 
t,a\, . . . , a n -\ we find that bj are polynomials of total degree 2(re — 1) — j in 
thea/s. Now disc Df (t) = c r Yij $j has total degree 2(n—l)— 2 = 2(n— 2) 
in the fy's, that is J2 r j = 2 ( n ~ 2 ) 5 and b Y <E32]), Ej 3 r i = (n - l)(n - 2). 
Thus the total degree of discDj (i) in oi, . . . , a n _i is 

^rjdegfej =^Vj-(2(n - 1) - j) = 2(n - l)^rj - ^J>j 

3 

= 2(re - 1) • 2(n - 2) - (n - l)(n - 2) = 3(n - l)(n - 2) 

as claimed. 

Arguing similarly, one sees that the resultant Res(D/(f), Df+ a (t)) has 
total degree 3(n — l) 2 in the coefficients a\, . . . , a n -\. 

Assume gcd(g, re) = 1. Then /'(t) = nx n ~ l + (re — l)a n -\x n ~ 2 + . . . has 
degree re — 1 and by (|2.15p , (|2.14p we find 

(2.16) D f (t) = di SCx (f(*)+t) = (-lT {n - 1)/2 n n J] + 

/'(p)=o 

has degree re — 1, with roots —f(ft) as /? runs over the re — 1 roots of f'(x). 

In the case where gcd(q, re) > 1, f'(t) = —a n -\x n ~ 2 + . . . has degree n — 2 
provided that a n _i 7^ 0, in which case by (|2.15p . (|2.14p we have 

(2.17) D f (t)=di SCx (f{x)+t) = (-ir^l 2 a:U n (t + f(p)) 

f'(p)=o 

which has degree n — 2 and again has roots —f(p) as p runs over the n — 2 
roots of f'(x). 

3. NON- VANISHING OF THE RESULTANT 

Proposition 3.1. Given a nonzero polynomial a £ with dega < 

re, then a \-t Res(Df(t),Df +a (t)) is not the zero polynomial, that is, the 
polynomial function 

(3.1) R{a) := Res t (D f {t),D f+a (t)) € Z[a] 

is not identically zero. 

Applying this to a = ctj — cti for each j ^ i will show that (12.1ip holds. 
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Proof. Write a(x) = A n -ix n ~ 1 H + A G F g [x] with dega < n. 

Let p be the characteristic of ¥ q . Assume first that p \ n. Then by (|2,16|) 
and we find 

(3.2) Res( J D / , J D /+Q ) = n 2 ^- 1 ) ]J (/(p 2 ) + a(p 2 ) - f( Pl j) 

/'(Pi) = 
/'( P2 ) + Q'(p 2 ) =o 

If p | n but a n __i / and a n __i + Ai-i / 0, then by (|2.17p and (|2.14p , we 
find 

(3.3) Res(D f ,D f+a ) = a^K-i + ^n-i) n{n - 2) 

x II (f(P2) + a(p 2 )-f( Pl )) 

/'(Pi) = 

/'(P2) + Q'(P2) =0 

Note that when a„_i = or a n _i + A n _i = 0, the resultant Res(D/, Df +a ) 
is given by different polynomials than in the above case. However, this 
might affect at most 2q n ~ 2 "bad" a's, which is a negligible amount, and the 
conclusion of (|2.1ip remains valid. 

In both cases above, the "bad" a's are those for which there are p%, p 2 G 
such that 

(3.4) f'( P i) = 0, f(p 2 ) = -a'(p 2 ), f{ P2 )-f{p 1 ) = -a(p 2 ) 
This is a linear system of equations for a G A n ~ , which has the form 

(3.5) M(p)a = b(p), p={pi,p%) 

for a suitable 3 X (n — 1) matrix M(p) and vector b(p) G A 3 . Thus over 
F q , the solutions of R(a) = are precisely those a£F* which satisfy the 

2 

system (|3.5p for some p G F ? . 

We consider the affine variety (possibly reducible) defined by these equa- 
tions 

(3.6) Z = {(p, a) G A 2 x A"" 1 : M(p)a = b(p)} C A 2 x A"" 1 

Let : Z — > A™^ 1 be the restriction to Z of the projection A 2 x A" -1 — >• 
A n_1 , and ir : Z — > A 2 the restriction to Z of the projection A 2 x A n_1 — > A 2 . 

(3.7) Z C A 2 x A"" 1 

A 2 A™" 1 

From the above, the solution set of R{a) = is precisely 4>(Z). 

We will show that Z has dimension n — 2, and hence the dimension of 
{R = 0} = 4>(Z) cannot exceed n — 2 and hence is not all of A™ -1 . Thus R 
is not the zero polynomial, proving Proposition 13.11 

To do so, we study the dimensions of the fibers ir~ 1 (p), which are affine 
linear subspaces. We first assume that n > 3. In this case we will show that 
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tt(Z) is dense in A 2 and generically, that is if p\ ^ p 2 , the fibers ir~ 1 (p) have 
dimension n — 4. Moreover there are at most deg a non-generic fibers, each 
of dimension n — 2. This will show that dim Z = n — 2. 
We rewrite the system f|3 . 5f) as: 

h 3a 3/ o 2 + 2a 2 pi + a\ = -np^ 1 

(3.8) h 3a 3 p2 + 2a 2 p 2 + a% = —a'(p 2 ) - np^ 1 

■■■ + «s(p| - Pi) + a 2 {p 2 2 - p\) + ai(p 2 - Pi) = -a(p 2 ) - (p 2 - Pi) 

To find the rank of the matrix M(p) we compute that 

/ 3p 2 2 Pl 1 \ 

(3.9) det 3p 2 2 2p 2 1 = ( Pl - p 2 ) 4 

\P2-Pl Pl-Pl P2-P1/ 

and thus M(p) has full rank 3 unless p± = p 2 , and so the generic fibers 
7r _1 (p) have dimension n — 1 — 3 = n — 4. 

In the non-generic case pi = /92 ; the matrix has rank one and we need 
a '{P2) = = a(p 2 ), which constrains us to have at most finitely many fibers 
(the number bounded by deg a/2), each of which has dimension n — 1 — 1 = 
n-2. 

Finally, the cases n = 2, 3 are handled similarly, except that the image 
of the map ir : Z — > A 2 is no longer dense, due to algebraic conditions 
constraining pi,p 2 . We omit the (tedious) details. □ 

4. NON- VANISHING OF THE DISCRIMINANT 

We wish to show that the condition for being in A n is not always satisfied. 
Without loss of generality, we can assume a» = 0. We first study a couple 
of small degree cases. 

For n = 2, disc(x 2 + ax + 1) = a 2 — At is linear and hence has no repeated 
roots (recall q is odd), hence A n is empty. When n = 3, we have 

(4.1) D f (t) = disc x (x 3 + ax 2 + bx + t) = (a 2 b 2 - 4b 3 ) + (I8a6 - 4a 3 ) t-27t 2 

If 3 | q then Df(t) = (a 2 b 2 - 46 3 ) - 4a 3 t has degree one for a / 0; if 3 \ q 
then D(t) has degree 2 and we compute that 

(4.2) disct disc^a; 3 + ax 2 + bx + t) = -16(a 2 - 36) 3 
which is clearly not identically zero. So we may assume n > 4. 

4.1. Similarly to our approach in the previous section, it suffices to show 
that outside a set of a's of codimension at least one in the parameter space 
A n_1 , Df(t) is of positive degree, and is square- free, that is with nonzero 
discriminant. 

We conclude from (12T161) . (pJ?]) that if n > 4 and a is in the "bad" set 
(but a n _i if gcd(ra, q) ^ 1), then at least one of the following occurs: 
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• There is some p £ ¥ q for which f'(x) has a double zero at x = p, 
that is, there is some p G F„ for which 

(4-3) f'( P ) = 0, /» = 

• There are two distinct p\ ^ p2 so that f(pi) = /(p 2 ) and so that 
f'(x) vanishes at both x = p\ and x = p2, that is 

(4.4) />l) = 0, /'(p 2 ) = 0, /( P1 ) = /( P2 ) 

We want to show that the set of a G 1 which solve at least one of (|4.3j) , 
(14.411 has dimension at most n — 2. 



4.2. We first look at / for which fj4.3f) happens. This gives a pair of equa- 



— n— 1 
tions for a G F ? : 



+ 2pa 2 + a\ = -np n 1 



(4.5) 

• • • + 2a 2 + = -n(n - l)p n ~ 2 

Defining 

(4.6) W = {(p, a) G A 1 x A"" 1 : g3]) holds} 

we have a fibration of W over the p line A 1 and a map 4> : W — > A n_1 , the 
restriction of the projection A 1 x A™ -1 — >• A n_ , 

(4.7) IfcA'x A™" 1 




n— 1 



and the solutions of (|4.3p are precisely (f)iW). 

The system (|4.5p is nonsingular (rank 2) and hence 7r : W — > A 1 is 
surjective and for each p the dimension of the solution set isn — 1 — 2 = n — 3. 
we find that dimVF = n — 2 and hence dim(/>(l^) < n — 2. 

4.3. Next we consider the system (|4.4p which given pi 7^ p2 is a linear 

— n— 1 

system for o G F g , of the form 

h 3p 2 a 3 + 2pia 2 + ai = -np\~ x 

(4.8) • • • + 3^03 + 2p 2 a 2 + ai = -np™ -1 

• • • + (p| - P?)as + (p 2 - P?)a 2 + (P2 ~ Pi)ai = -p n 2 + p n x 

This system shares the matrix part of (|3.8|) . and hence has rank 3 for 
every pi 7^ p 2 . Thus the arguments of the previous section show that 

(4.9) {a G A™" 1 : 3 Pl + p 2 s.t. (JO} holds} 

is of dimension at most n — 2. This shows that (|2.10p holds, thus concluding 
the proof of Proposition 12.11 
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